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Abstract 

A linear extension of a poset P is a permutation of the elements of the set that respects the 
partial order. Let L{P) denote the number of linear extensions. It is a #P complete problem 
to determine L{P) exactly for an arbitrary poset, and so randomized approximation algorithms 
that draw randomly from the set of linear extensions are used. In this work, the set of linear 
extensions is embedded in a larger state space with a continuous parameter (3. The introduction of 
a continuous parameter allows for the use of a more efficient method for approximating L{P) called 
TPA. Our primary result is that it is possible to sample from this continuous embedding in time 
that as fast or faster than the best known methods for sampling uniformly from linear extensions. 
For a poset containing n elements, this means we can approximate L{P) to within a factor of 1 + e 
with probability at least 1 — 6 using an expected number of random bits and comparisons in the 
poset which is at most 0(n'^(lnn)(lnL(P))^e~^ ln5~^). 
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1 Introduction 

Consider the set [n] = {1, . . . , n} and a partial order P = ([n], ^) on this set. Then a linear 
extension of the poset P is a permutation a of [n] such that for all i < j, a{i) ^ Say 
that such a permutation respects the partial order. 
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Our goal here is to efficiently count the number of linear extensions, which we denote 
L{P). In general, finding L{P) is a #P complete problem [2], and so instead of an exact 
deterministic method, we develop a randomized approximation method. 

There are many applications of this problem. Morton et al. [9] have shown that a par- 
ticular type of convex rank test for nonparametric models can be reduced to counting linear 
extensions. Many data sets such as athletic competitions or product comparisons do not 
have results for every possible pairing, but instead have an incomplete set of comparisons. 
Counting linear extensions can be used to develop estimates of the actual rank of the items 
involved (see [3].) 

Previous results Previous methods for this problem ([HE]) concentrated on sampling from 
the set of linear extensions where some of the permutation values are fixed ahead of time. 
Generating a single uniform sample from the set of linear extensions takes 0{n^ In n) expected 
number of random bits, using a number of comparisons that is at most the number of random 
bits [6j. Using the self-reducibility method of Jerrum et al. |H], this can be used to estimate 
L{P) to within a factor of 1 + e with probability at least 1 — S in time 0(n^(lnn)^e~^ ln(l/(5)). 

Here we take a different approach. Instead of sampling uniformly from the set of permu- 
tations, a weighted distribution is used that has a parameter /3. The weight assigned to an el- 
ement varies continuously with /3, and this allows us to use a new method for turning samples 
from our weighted distribution into an approximation for L{P) called the Tootsie Pop Algo- 
rithm (TPA). The use of TPA gives us an algorithm that is O {{In L{P))^n^ (In n)e-^\n{l/S)). 
In the worse case, InL(P) is 0{n\nn) and the complexity is the same as the older algorithm, 
however, if L{P) is small compared to n\, this algorithm can be much faster. Even in the 
worst case, the constant hidden by the big-0 notation is much smaller for the new algorithm 
(see Theorem m of Section El) 

Organization In the next section, we describe the self-reducibility method and TPA in 
detail. Section E] illustrates the use of TPA on a simple example, and then Section H] shows 
how it can be used on the linear extensions problem by adding the appropriate weighting. 
Section El then shows how the non-Markovian coupling from the past method introduced 
in |6] can also be used for this new embedding, and Section El collects results concerning 
the running time of the procedure, including an explicit bound on the expected number of 
random bits and comparisons used by the algorithm. 

2 The Tootsie Pop Algorithm 

In [S], Jerrum et al. noted that for self- reducible problems, an algorithm for generating from 
a set could be used to build an approximation algorithm for finding the size of the set. 
Informally, a problem is self-reducible if the set of solutions can be partitioned into the 
solutions of smaller instances of the problem (for precise details, see [8].) 

For example, in linear extensions once a{l) is determined, the problem of drawing 
(t(2), . . . , cr{n) is just a smaller linear extension generation problem. 

While a theoretical tour de force, as a practical matter using self-reducibility to build 
algorithms is difficult. The output of a self-reducibility algorithm is a scaled product of 
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binomials, not the easiest distribution to work with or analyze precisely. 

The Tootsie Pop Algorithm (TPA) [7] is one way to solve this difficulty. Roughly speaking, 
TPA begins with a large set (the shell) containing a smaller set (the center). At each step, 
TPA draws a sample X randomly from the shell, and reduces the shell as much as possible 
while still containing X. The process then repeats, drawing samples and contracting the 
shell. This continues until the sample drawn lands in the center. The number of samples 
drawn before one falls in the center has a Poisson distribution, with parameter equal to the 
natural logarithm of the ratio of the size of the shell to the center. 

To be precise, TPA requires the following ingredients 

(a) A measure space (f2, J^, /i). 

(b) Two finite measurable sets B and B' satisfying B' C B. The set B' is the center and 
B is the shell. 

(c) A family of nested sets {A{f3) : (3 e R} such that (3 < (3' implies A{I3) C A{(3'). Also 
fj.{A{f3)) must be a continuous function of /3, and lim^_i._oo At(^(/3)) = 0. 

(d) Special values /3b and /J^/ that satisfy A(/3b) = B and A{f3B') = B' . 
With these ingredients, TPA can be run as follows. 

Algorithm 2.1 TPA(r, (3b, (3 b') 

Input: Number of runs r, initial index [3bi final index (3b' 
Output: L (estimate of fi{B) / fi{B')) 
1: k^O 

2: for i from 1 to r do 
3: 13 Pb, k i- k - 1 
4: while (3 > (3b' do 

5: k^k + l,X^ KA{P)), 13 ^ inf{/3' G : X G A(/3')} 

6: end while 
7: end for 

8: 1/ ^ exp(/c/r) 



Let A = ln{fi{B) / iJ,[B')) , so that exp(y4) is what we are trying to estimate. Then each 
run through the for loop in the algorithm requires on average A + 1 samples, making the 
total expected number of samples r{A + 1). The value of k in line 7 of the algorithm is 
Poisson distributed with parameter rA. This means that r should be set to about A so that 
fc/r is tightly concentrated around A. 

But we do not know A ahead of time! This leads to the need for a two-phase algorithm. 
In the ffist phase r is set to be large enough to get a rough approximation of A, and then in 
the second phase r is set based on our estimate from the ffist run. That is: 

1. Call TPA with ri = 21n(2/5) to obtain Li, and set Ai = In(Li). 

2. Call TPA with = 2(ii + + 2)[ln(l + e)^ - ln(l + e)^]-^ ln(4/5) to obtain the 
final estimate. 

The result is output L2 that is within a factor of 1 + e of L{P) with probability at least 1 — S. 
This is shown in Section [HI 
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3 Continuous embedding: simple example 



To illustrate TPA versus the basic self-reducibility approach, consider a simple problem that 
will serve as a building block for our algorithm on linear extensions later. In this problem, 
we estimate the size of the set {1,2,..., n} given the ability to draw samples uniformly from 
{1,2,..., b} for any b. 

In the self-reducibility approach, begin by setting /3i = [n/2] and drawing samples from 
{1, . . . ,?T,}. Count how many fall into {1, . . . and use this number di (divided by the 
number of samples) as an estimate of (3i/n. Now repeat, letting (32 = and estimating 

(''2 = (^2/ Pi until (3k = 1. Note that 

E aiOs ■ ■ ■ flfc-i = —- = — . 

n (3i n 

Since the final estimate a of - is the product of A; — 1 estimates, Fishman called this algorithm 
the product estimator [1] . The problem with analyzing the output of the product estimator, 
is that it is the product of k scaled binomials. 

To use TPA on this problem, it needs to be embedded in a continuous setting. Consider 
the state space [0,n]. The family of sets needed for TPA will be [0,/3], where (3b = n and 
(3b' = 1. This makes the ratio of the measure of [0, (3b] to [0, (3b'] equal to n. 

Note that you can draw uniformly from [0, (3] in the following two step fashion. First 
drawX e {1,2,..., \(3]} so that P(X = i) = 1//3 for i < (3 and P(X = (3) = {1 + (3 - \(3])/ (3. 
U X < (3, draw Y uniform on [0, 1], otherwise draw Y uniform on [0,1 + (3 — [/?]]. The final 
draw isW = X-l + Y. 

TPA starts with (3o = n, then draws W as above. The infimum over all /3 such that 
W e [0,(3] is just (3 = W. So (3i just equals W. Next, redraw W from [0, (3i]. Again, the 
infimum of (3 satisfying W G [0, (3] is just W, so (32 equals this new value of W. 

This process repeats until W falls into [0, 1]. The estimate k for Inn is just the number 
of steps needed before the final step into [0, 1]. Note that k can equal if the very first 
step lands in [0, 1]. This random variable k will be Poisson distributed with parameter Inn. 
Recall that the sum of Poisson random variables is also Poisson with parameter equal to the 
sum of the individual parameters, so repeating the process r times and summing the results 
yields a Poisson random variable with parameter rlnn. Dividing by r and exponentiating 
then yields an estimate of n. 

4 Continuous embedding: linear extensions 

This approach can be extended to the problem of linear extensions as follows. A permutation 
can be viewed as a vector in [n]^ space, where if a{i) = j, then a{i') 7^ j for all i' 7^ i. 

In the continuous embedding, our vectors x are in (0, n]^, and if = j, then 7^ 

j for all i' ^ i. Such a vector x induces a unique permutation a by taking the ceiling of all 
its elements, moreover, the Lebesgue measure of the set of x that map to a particular a is 
always 1. 

Therefore the measure of the set of x that map to linear extensions of the poset P is just 
the number of linear extensions. The state space (0,n]"' can be restricted in a continuous 
fashion, leading to the use of TPA. 
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Let (Thome be any valid linear extension. For a vector x, define the distance from x to the 
home position to be: 

dix,ahome) = max \x{i) - ahome(^)l- 

If the distance is 0, then no element i is to the right of the home position. The only way 
that can happen is if = crhomc(0 for all i. Let A{P) be vectors x in (0,n]" that map 

to linear extensions of P and d{x,Xhomc) < /3- Then fj,{A{n)) = L{P) and fi{A{0)) = 1. 

In order to use this family for TPA, it is necessary to draw samples from A{P) for any 
f3 G [0,n]. As in the previous section, a two stage process will be used. In the first stage, a 
linear extension is drawn nonuniformly from the set of linear extensions. Any linear extension 
where a{i) — (Xhomc > /? has zero chance of being drawn. Any i where a{i) — (Xhome = f/?! 
contributes a penalty factor of 1 + /3 — to the probability of S{i). To be precise, let 
P(S = a) = wla)/Z{l3), where 

H^)= l[{{l + /3-\f3^)^a{^)-a^ome^) = \f3^) + l{a{t) - a^ome < F/^D), (1) 

ie[n] 

and Z{l3) = T.„w{a). 

For instance, suppose o"home = (1,2,3,4), and (3 = 1.3. The permutation ai = (3,2,4, 1) 
(if this was a valid linear extension) would have probability .3/Z(1.3) of occurring. Note the 
first component is 2 = [1.3] to the right of its home position, the second component is only 
1 of the right of its home position, and the last two components are to the left of their home 
position. The permutation a2 = (3,4,1,2) would have probability .3^/Z(1.3) of occurring, 
and so on. 

Once the linear extension S has been drawn, draw X as follows. For each i, let X{i) ~ 
Unif((^(i)-l,^(i)] if ^(i)-cThome < , othcrwise let X (z) ~ Unif((^(i)-1, ^(i)+/3- [/?]]). 
This is set up so that the normalizing constants in the uniform for X{i) cancels the 1 + /3— 
factor in the weight of a if a{i) — cxhome = \f3^■ Therefore X is a uniform choice over A{I3). 

5 Sampling from the continuous embedding 

For the continuous embedding to be useful for TPA, it must be possible to sample from the 
set of linear extensions with weights in ([T]). Once the linear extension has been created, 
sampling the continuous version is easy. 

To sample from the set of weighted linear extensions, first build a Markov chain whose 
stationary distribution matches the target distribution. This is done by using the Metropolis- 
Hastings approach with a proposal chain that chooses two adjacent elements uniformly at 
random and transposes them (if such a transposition obeys the partial order) with probability 
1/2. This is encoded in algorithm 15.11 In line 1, Unif denotes the uniform distribution, and 
Bern the Bernoulli distribution. 

From this chain, it is possible to build a method for obtaining samples exactly from the 
target distribution. The method of coupling from the past (CFTP) was developed by Propp 
and Wilson pLOj to draw samples exactly from the stationary distribution of Markov chains. 
For this problem, an extension called non-Markovian CFTP [6j is needed. 
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Algorithm 5.1 ChainStep(o") 

Input: current linear extension Markov chain state a 

Output: next linear extension Markov chain state a 

1: draw i ^ Unit ({1, . . . , n - 1}), Ci ^ Bern(l/2), C2 ^ Bern(l + /3 - [/?]) 

2: if Ci = 1 and not a{i) < a{i + 1) then 

3: if a{i) — crhomc(0 7^ [/?] — 1 or C2 = 1 then 

4: a <r- a{i + 1), a{i + 1) ^ cr(0) ^(0 ^ 

5: end if 

6: end if 



The method works as follows. First, a bounding chain |5j is constructed for the chain in 
question. A bounding chain is an auxiliary chain on the set of subsets of the original state 
space. That is, Abound = 2*^, where VL is the state space of the original chain. Moreover, 
there is a coupling between the original chain [at] and the bounding chain {St} such that 
at evolves according to the kernel of the original bounding chain, and at E St ^ (Xt+i G St+i- 

For us, the state of the bounding chain is indexed by a vector B E {1, . . . ,n, 6}"-. Let 

S{B) = {a : M{{B{j) = z) A {a{f) = i) ^ f < j)} 

For instance, if -8(3) = 4, then a G S{B) requires that a{l) = 4 or a{2) = 4 or a{3) = 4. In 
this setup ^ is a special symbol: if B{i) = 6, then there is no restriction on a whatsoever. 
To visualize what is happening with the state and bounding state, it will be useful to have 
a pictorial representation. For instance, if a = (4,2,3,1) and B = {9, 4,3, 9) this can be 
represented by: 

^|e_2_|4_3_|3j^|e- 

Note that the vertical line with subscript i (|j) always appears to the right of the underlined 
i. This is how the bounding state works, it keeps track of the right most position of the item 
in the underlying state. 

This bounding state has two other properties needed to use non-Markovian CFTP. First, 
there are several states that contains everything, for example B = {9, . . . ,9, crhome(l))- Sec- 
ond, when all the 9 values are gone, there is exactly one underlying state. That state must be 
equal to B. For instance, if 5 = (4, 3, 1, 2), then from the definition of S, the only possible 
underlying state is cr = (4, 3, 1, 2). 

We are now ready to state the procedure for updating the current state and the bounding 
state simultaneously. This operates as in Algorithm 15.21 Note that if the inputs to the 
Algorithm have i ~ Unit ({1, 2, . . . , ra}) and Ci ~ Bern(l/2), then the state a is updated 
using the same probabilities as the previous chain step. The key difference between how a 
and B are updated is that if a{i) = B{i + 1), then B is updated using C3 = 1 —Ci, otherwise 
C3 = Ci- In any case, since Ci ~ Bern(l/2), C3 ~ Bern(l/2) as well. 

Note that a is being updated as in Algorithm 15.11 The only different is the bounding 
state update. Our main result is: 

Theorem 1. If a G S{B), then running one step of Algorithm \5. S\ leaves a G S{B). 

Proof. Since only a{i), a{i + 1), B{i) and B{i + 1) are changing, they are the only states that 
need be considered. There are several cases. To describe the cases more succinctly, use * to 
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Algorithm 5.2 BoundmgChamStep((7, S, Ci, C2) 

Input: current state and bounding state (a. B) 
Output: next state and bounding state {a, B) 
1: C3 ^ (1 - Ci)l(a(i) = B{i + 1)) + Cil(a(z) ^ B{i + 1)) 
2: if Ci = 1 and not a{i) -< a{j) then 
3: if (T{i) - (7home(0 7^ f/S] - 1 Or C2 = 1 then 
4: a ■<— (T(i + 1), (7{i + !)•<— (7(i), (T{i) •<— a 

5: end if 
6: end if 

7: if C3 = 1 and not B{i) < B{j) then 

8: if B{i) - (7home(0 7^ T/^l - 1 or C2 = 1 then 

9: a ^ B{i + l), B{i + l) ^ B{i), B{i) ^ a 

10: end if 

11: end if 

12: if = e then 

13: p ■<— the number of i such that B{i) ^ 9 

14: 5(n) ^ (7home(P+ 1) 

15: end if 



denote a value of o"(i) or a{i + 1) that is neither B{i) nor + 1). For instance, the case 
^|n_h_|6 indicates B{i) = a, B{i + 1) = 6, a{i + 1) = 6, and cr(i) is neither a nor h. These 
cases are given in the table below. 

First note if {a{i), a{i + 1)} n {B{i), B{i + 1)} = 0, then if a{i') ^ B{i) then i' < i and 
switching B{i) and B{i + 1) leaves B{i) bounded. Similarly, if cr{i') = B{i + 1) then i' < i 
and switching B{i) and B{i + 1) leaves B{i + 1) bounded. Hence the only interesting cases 
are when the sets {o'(i), u{i + 1)} and {B{i), B{i + 1)} overlap. These cases are given below. 

Case 1: a \a * \f). For a to move right, it must be true that Ci = 1 and it must be false 
that a ^ a{i + 1). If a — (Thome (0 = F/^l then it is also necessary that C2 = 1. Under these 
conditions, the |a also always moves right, so continues to bound a. 

Case 2: a |„ * \h. That b is bounded just follows from the fact that if cT(i') = b, then 
i' < i. This further implies that it is not true that a ^ b. Hence (just as in Case 1) if a 
swaps, then \a must swap as well. 

Case 3: _a_|a_b_|f,. Since Ci = C3 and the swap for a and b and \a and are determined 
in the same fashion, either they both swap or neither pair swaps. 

Case 4: .a^leJ^U- For ^ to move right, it must be true that Ci = 1, which makes C3 = 0. 
Hence the !„ continues to bound a. 

Case 5: _a^|6j^|a- Same as Case 4. 

Case 6: J^|6»_b_|fe- There are three conditions under which b stays in location i + 1: 1) 
Ci = 0. 2) a{i) ^ b. 3) a{i) — (7home(0 — \P^ ^^<i ^2 = 0. If 1) occurs, then \b does not move 
either. It turns out that 2) cannot occur. To see this, let c be any element that precedes b 
in the partial order. Then it also precedes b in the home permutation Uhome- That means 
that in lines 12 through 15, |c was introduced before the |(, bound was introduced. Since the 
|c bound is always to the left of the |b bound, and is not in position i, it must be somewhere 
to the left of i. Hence c is in a location to the left of i, that is, a{i') — c ^ i' < i. Since c 
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was an arbitrary element that precedes b, situation 2) never occurs. 

Finally, if 3) occurs, then also does not move. Hence the only times b stays put, \b will 
also stay, and lines 2 through 11 maintain the property that a is bounded by B. 

The last lines of the algorithm (12 through 15) deal with the situation where B{n) = 9. 
In this case switching B{n) to any nonzero number will not hurt the bounding property. 
Therefore we switch to the "next" number in line, which is found by using the values in 

order (Thome- □ 

Hence if a is bounded by -B, it will still be bounded after taking one step in the bounding 
chain step. With this established, samples from the target distribution can be generated as 
in Algorithm 15.31 [HI [ID] using non-Markovian CFTP. 

Algorithm 5.3 Generate(t) 

Input: t number of steps to use to generate a sample 
Output: cr drawn from the weighted distribution 

1: cr ^ (Thome, B ^ {9, . . . ,9, (Thomc(l)) 

2: for j from 1 to t do 

3: draw i{j) ^ Unif({l, 1}), Ci{j) ^ Bern(l/2), 

C2(j)^Bern(l + /3-[/31) 
4: ((T, B) ^ BoundingChainStep((T, B, Ci(j), C2{j)) 
5: end for 

6: if for all i, B{i)^9 then 
7: B 
8: else 

9: a <r- Generate(2t) 
10: for j from 1 to t do 

11: ((T, B) ^ BoundingChainStep((j, B, Ci{j), C2{j)) 

12: end for 
13: end if 



6 Analysis 

In this section we prove several results concerning the running time of the procedure outlined 
in the previous section. 

Theorem 2. The non-Markovian coupling from the past in Algorithm 15.51 requires an ex- 
pected number of random bits bounded by A.3n^{lnn){\log2n'\ +3) and a number of compar- 
isons bounded by 8.6n^lnn. 

Theorem 3. For e < 1, the two-phase TPA approach outlined at the end of Section {E 
generates output L2 such that 

+ e)-^ < L2/L{P) <l + e)>l-6. 
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Theorem 4. The expected number of random bits needed to approximate L{P) to within a 
factor ofl + e with probability at least 1 — 6 is bounded above by 



4.3n^(lnn)([log2 +3)[2(A+1) ln(2/(5) + (A+l)(A+3V2A+2)(ln(l+e)'-ln(l+e)^) ln(4/(5)]. 

Proof of Theorem Lemma 10 of [B] showed that when there is no (3 parameter, the ex- 
pected number of steps taken by non-Markovian CFTP was bounded above by 4.3n^lnn. 

So the question is: once the /3 parameter falls below n, does the bound still hold? The 
bound was derived by considering how long it takes for the \g values in the bounding state 
to disappear. Each time a reaches position n, it is removed and replaced by something of 
the form !„. When all the \g disappear, the process in Algorithm 15.31 terminates. 

When there is no P, the probabilities that a particular \g bound moves to the left or the 
right are equal: both l/(2n). (This does not apply when the bound is at position 1, in which 
case the bound cannot move to the left.) The result in [6] is really a bound on the number 
of steps in a simple random walk necessary for the \g bounds to all reach state n. 

Now suppose that (3 E (0,n). The probability that a bound moves to the right is still 

l/(2n), but now consider when the state is of the form . . . \a \e ■ ■ ■■ For to move left 

the \a has to move right, and this could occur with probability {1 + (3 — \(3'])/{2n). That is, 
with P G (0,n), the chance that the \g moves left can be below l/{2n). 

This can only reduce the number of moves necessary for the \g bounds to reach the right 
hand side! That is, the random variable that is the number of steps needed for all the \g 
bounds to reach position n and disappear is dominated by the same random variable for 
/3 = n. Hence the bound obtained by Lemma 10 of [6] still holds. 

Now to the random bits. Drawing a uniform number from {1, . . . , n} takes [log2 n\ bits, 
while drawing from {0, 1} for coin Ci takes one bit. The expected number of bits needed to 
draw a Bernoulli random variable with parameter not equal to 1/2 is two, and so the total 
bits needed for one step of the process (in expectation) is [log2 n] + 3. Each step in the 
bounding chain and state uses at most two comparisons. □ 

It will be helpful in proving Theorem [3] to have the following bound on the tail of the 
Poisson distribution. 

Lemma 1. For X ~ Pois(/i) and a < n, P(X > jj, + a) < exp(-(l/2)aV/U + (l/2)a^//i2) 
and for a < fi, P(X < fi — a) < exp(— a^/(2/i)). 

Proof. These follow from Chernoff Bounds which are essentially Markov's inequality applied 
to the moment generating function of the random variable. The moment generating function 
of X is E[exp(tX)] = exp(^(e* - 1)). So for a > 

F(X > a + a) = P(exp(tX) > exp(t(u + a)) < ^^P^^^j^ ~ . 

exp(t(/i + a)) 

Setting t = ln(l + a/ fi) minimizes the right hand side, and yields: 

P(X > i2 + a) < exp(a - (/i + a) ln(l + a//i)). 
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For a < /i, -ln(l + a//i) < -a//i + (l/2)aV/i^ so P(X > /i + a) < exp(-(l/2)aV/i + 
{l/2)a? / jj?) as desired. For the second result: 

P(X < /i - a) = P(exp(-tX) > exp(-t(/i - a)) < "^""P^^^^ * ~ 



exp(— t(yU — a)) 



Setting t = — ln(l — a/yu) then yields the next result. 
For a < fi, — ln(l — a//i) < a/ fi + (l/2)(a//i)^. So 



-a-(/i-a)ln(l-a//i) < -a + (/i- a) ((a//i) + (1/2) (a/^u)) = -(l/2)(aV/i) - (l/2)(aV/"^)- 
The right hand side is at most — (l/2)a^/yU, which completes the proof. □ 

Proof of Theorem\^ Consider the first phase of the algorithm, where TPA is run with ri = 
21n(2/(5). Consider the probability of the event {Ai + ■\/~Ai + 2 < A\. This event cannot 
happen if A < 2. If A > 2, then this event occurs when Ai<A- (3/2) - ^A - 7/4. Since 
TiAi ~ Pois(ryl), Lemma [T] can be used to say that 

P(riii < riA - ri(3/2 + ^A - 7/4)) < exp(-(l/2)(ri(3/2 + ^ A-7 /A)f /{r^Af 

< exp(-(l/2)ri(9/4 + A- 7/4)/^^) 

< exp(-(l/2)ri/A) 

< 2/6. 

In other words, with probability at least 1 — 6/2, Ai + ^/Al + 2 > A. 

Now consider the second phase. To simplify the notation, let e' = ln(l + e), and A2 = 
exp(L2) where L2 is the output from the second phase. Then from the first phase r2 > 
A{e'^ - e'^)"^ ln(4/5) with probability at least 1 - 6/2. 

So from Lemma [11 

F{r2A2 > r^A + r^e') < exp(-(l/2)(r2e')V(^2^) + (l/2)(r2e')V(^2^)') 

= exp(-(l/2)r2e'VA + {l/2)r2e'yA') 
< exp(-ln(4/5)). 

A similar bound holds for the left tail: 

P(r2i2 < r^A - r2e') < exp(-(l/2)r2e'V(r2A)) < 6/4. 

Therefore, the total probability that failure occurs in either the first phase or the second is 
at most 6/2 + 6/4 + 6/4 = 6. If r2A2 is within additive error r2e' = r2 ln(l + e) of r2A, then 
1/2 = exp(A2/r) is within a factor of 1 + e of exp(y4), showing the result. □ 

To bound the expected running time, the following loose bound on the expected value of 
the square root of a Poisson random variable is useful. 



Lemma 2. For X ^ Pois(/i), E[VX] < 3y/2jl 
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Proof. First note E[VX] = E[VX1{X < 2^)] + E[VX1{X > 2/i)]. Then E[VX1{X < 
2/i)] < y/2jl, while the second term is: 

oo 

E[VX1{X > 2^)] = ViF{X = i). 

The ratio between successive terms in this series is: 

yiTTexp(-/i)(/i)^+V(z + 1)! /i 



< 1/2. 



v^exp(-/i)(/i)V(0! V^i^ + 1) 

The first term in the series is at most y^2jl, so the series sums to at most 2A/2/i. □ 



Proof of Theorem ^ From Theorem [2|, the expected number of bits per sample is bounded 
by A.3n^{lnn){\log2n] + 3) and does not depend on the sample. Hence the total number 
of expected bits can be bounded by the expected number of bits per samples times the 
expected number of samples. The first phase of TPA uses ri = 21n(2/5) runs, each with an 
expectation of A + 1 samples per run to make ri(A + 1) expected samples. The second phase 
uses ra = {Ai + + 2)[ln(l + ef - ln(l + e)^] ln(4/5) runs, where ri^i ~ Pois(riA). So 
from Lemma [21 

Using A = In L{P) and then combining these factors yields the result. □ 



7 Conclusion 

TPA is a sharp improvement on the self-reducibility method of Jerrum et al. for estimating 
the size of a set. At first glance, the continuity requirement of TPA precludes its use for 
discrete problems such as linear extensions. Fortunately, discrete problems can usually be 
embedded in a continuous space to make the use of TPA possible. Here we have shown how 
to accomplish this task in such a way that the time needed to take samples is the same as for 
uniform generation. The result is an algorithm that is much faster at estimating the number 
of linear extensions than previously known algorithms. 
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